Multimodal Summarization of User-Generated Videos

نویسندگان

چکیده

The exponential growth of user-generated content has increased the need for efficient video summarization schemes. However, most approaches underestimate power aural features, while they are designed to work mainly on commercial/professional videos. In this work, we present an approach that uses both and visual features in order create summaries from Our produces dynamic summaries, is, comprising “important” parts original video, which arranged so as preserve their temporal order. We use supervised knowledge aforementioned modalities train a binary classifier, learns recognize important Moreover, novel dataset contains videos several categories. Every 1 s part each our been annotated by more than three annotators being or not. evaluate using classification strategies based audio, fused features. experimental results illustrate potential approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multimodal Semantics Extraction from User-Generated Videos

User-generated video content has grown tremendously fast to the point of outpacing professional content creation. In this work we develop methods that analyze contextual information of multiple user-generated videos in order to obtain semantic information about public happenings (e.g., sport and live music events) being recorded in these videos. One of the key contributions of this work is a jo...

متن کامل

Fast Summarization of User-Generated Videos Using Semantic, Emotional and Quality Clues

This paper introduces a novel approach for fast summarization of user-generated videos (UGV). Different from other types of videos where the semantic contents may vary greatly over time, most UGVs contain only a single shot with relatively consistent high-level semantics and emotional content. Therefore, a few representative segments are generally sufficient for a summary, which can be selected...

متن کامل

Summarization of ICU Patient Motion from Multimodal Multiview Videos

Clinical observations indicate that during critical care at the hospitals, patients sleep positioning and motion affect recovery. Unfortunately, there is no formal medical protocol to record, quantify, and analyze patient motion. There is a small number of clinical studies, which use manual analysis of sleep poses and motion recordings to support medical benefits of patient positioning and moti...

متن کامل

Predicting Emotions in User-Generated Videos

User-generated video collections are expanding rapidly in recent years, and systems for automatic analysis of these collections are in high demands. While extensive research efforts have been devoted to recognizing semantics like “birthday party” and “skiing”, little attempts have been made to understand the emotions carried by the videos, e.g., “joy” and “sadness”. In this paper, we propose a ...

متن کامل

Title Generation for User Generated Videos

A great video title describes the most salient event compactly and captures the viewer’s attention. In contrast, video captioning tends to generate sentences that describe the video as a whole. Although generating a video title automatically is a very useful task, it is much less addressed than video captioning. We address video title generation for the first time by proposing two methods that ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2021

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app11115260